Search CORE

88 research outputs found

Assessing the risk due to software faults: estimates of failure rate versus evidence of perfection

Author: Antonia Bertolino
Lorenzo Strigini
Publication venue: 'Wiley'
Publication date: 01/01/2002
Field of study

Crossref

Recommended from our members

Effects of incorrect computer-aided detection (CAD) output on human decision-making in mammography

Author: Andrey Povyakalo
Castellino
Egglin
Eugenio Alberdi
Hartswood
Littlewood
Lorenzo Strigini
Meyer
Peter Ayton
Rutter
Skitka
Skitka
Taylor
Publication venue: 'Elsevier BV'
Publication date: 01/08/2004
Field of study

To investigate the effects of incorrect computer output on the reliability of the decisions of human users. This work followed an independent UK clinical trial that evaluated the impact of computer-aided detection(CAD) in breast screening. The aim was to use data from this trial to feed into probabilistic models (similar to those used in "reliability engineering") which would detect and assess possible ways of improving the human-CAD interaction. Some analyses required extra data; therefore, two supplementary studies were conducted. Study 1 was designed to elucidate the effects of computer failure on human performance. Study 2 was conducted to clarify unexpected findings from Study 1

City Research Online

Crossref

Conservative Confidence Bounds in Safety, from Generalised Claims of Improvement & Statistical Evidence

Author: Salako Kizito
Soc IEEE Comp
Strigini Lorenzo
Zhao Xingyu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2021
Field of study

“Proven-in-use”, “globally-at-least-equivalent”, “stress-tested”, are concepts that come up in diverse contexts in acceptance, certification or licensing of critical systems. Their common feature is that dependability claims for a system in a certain operational environment are supported, in part, by evidence – viz of successful operation – concerning different, though related, system[s] and/or environment[s], together with an auxiliary argument that the target system/environment offers the same, or improved, safety. We propose a formal probabilistic (Bayesian) organisation for these arguments. Through specific examples of evidence for the “improvement” argument above, we demonstrate scenarios in which formalising such arguments substantially increases confidence in the target system, and show why this is not always the case. Example scenarios concern vehicles and nuclear plants. Besides supporting stronger claims, the mathematical formalisation imposes precise statements of the bases for “improvement” claims: seemingly similar forms of prior beliefs are sometimes revealed to imply substantial differences in the claims they can support

University of Liverpool Repository

City Research Online

Warwick Research Archives Portal Repository

Validation of Ultrahigh Dependability for Software-Based Systems

Author: Barwise J
Bev Littlewood
Bishop P.G.
de Groot M.H.
Giloth F.K.
Knight J.C.
Lorenzo Strigini
Miller D.R.
Perrow C.
Powell D.
RTCA Committee SC-176 Software Considerations in
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/1993
Field of study

Modern society depends on computers for a number of critical tasks in which failure can have very high costs. As a consequence, high levels of dependability (reliability, safety, etc.) are required from such computers, including their software. Whenever a quantitative approach to risk is adopted, these requirements must be stated in quantitative terms, and a rigorous demonstration of their being attained is necessary. For software used in the most critical roles, such demonstrations are not usually supplied. The fact is that the dependability requirements often lie near the limit of the current state of the art, or beyond, in terms not only of the ability to satisfy them, but also, and more often, of the ability to demonstrate that they are satisfied in the individual operational products (validation). We discuss reasons why such demonstrations cannot usually be provided with the means available: reliability growth models, testing with stable reliability, structural dependability modelling, as well as more informal arguments based on good engineering practice. We state some rigorous arguments about the limits of what can be validated with each of such means. Combining evidence from these different sources would seem to raise the levels that can be validated; yet this improvement is not such as to solve the problem. It appears that engineering practice must take into account the fact that no solution exists, at present, for the validation of ultra-high dependability in systems relying on complex software

CiteSeerX

City Research Online

Crossref

Recommended from our members

Modeling software design diversity

Author: ANDERSON T.
BABBAGE C.
Bev Littlewood
BISHOP P. G.
BISHOP P.G.
FAA
HAGELIN G.
KNIGHT J.C.
LARYD A.
LINDEBERG J. F.
Lorenzo Strigini
Peter Popov
POPOV P.
TRAVERSE P. J.
VOGES U.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2001
Field of study

Design diversity has been used for many years now as a means of achieving a degree of fault tolerance in software-based systems. Whilst there is clear evidence that the approach can be expected to deliver some increase in reliability compared with a single version, there is not agreement about the extent of this. More importantly, it remains difficult to evaluate exactly how reliable a particular diverse fault-tolerant system is. This difficulty arises because assumptions of independence of failures between different versions have been shown not to be tenable: assessment of the actual level of dependence present is therefore needed, and this is hard. In this tutorial we survey the modelling issues here, with an emphasis upon the impact these have upon the problem of assessing the reliability of fault tolerant systems. The intended audience is one of designers, assessors and project managers with only a basic knowledge of probabilities, as well as reliability experts without detailed knowledge of software, who seek an introduction to the probabilistic issues in decisions about design diversity

City Research Online

Crossref

Recommended from our members

A conservative bound for the probability of failure of a 1-out-of-2 protection system with one hardware-only and one software-based protection train

Author: Andrey Povyakalo
Bev Littlewood
Eckhardt
Eckhardt
Garrett
Hughes
Knight
Lee
Littlewood
Lorenzo Strigini
Nakashima
Peter Bishop
Peter Popov
Popov
Robin Bloomfield
Publication venue: 'Elsevier BV'
Publication date: 01/10/2014
Field of study

Redundancy and diversity have long been used as means to obtain high reliability in critical systems. While it is easy to show that, say, a 1-out-of-2 diverse system will be more reliable than each of its two individual “trains”, assessing the actual reliability of such systems can be difficult because the trains cannot be assumed to fail independently. If we cannot claim independence of train failures, the computation of system reliability is difficult, because we would need to know the probability of failure on demand (pfd) for every possible demand. These are unlikely to be known in the case of software. Claims for software often concern its marginalpfd, i.e. average across all possible demands. In this paper we consider the case of a 1-out-of-2 safety protection system in which one train contains software (and hardware), and the other train contains only hardware equipment. We show that a useful upper (i.e. conservative) bound can be obtained for the system pfd using only the unconditional pfd for software together with information about the variation of hardware failure probability across demands, which is likely to be known or estimatable. The worst-case result is obtained by “allocating” software failure probability among demand “classes” so as to maximize system pfd

City Research Online

Crossref

Recommended from our members

CAD in mammography: lesion-level versus case-level analysis of the effects of prompts on human decisions

Author: Andrey A. Povyakalo
D Gur
E Alberdi
E Alberdi
Eugenio Alberdi
JJ Fenton
LAL Khoo
LJ Skitka
Lorenzo Strigini
M Bazzocchi
N Houssami
Peter Ayton
PM Taylor
R Parasuraman
Rosalind Given-Wilson
SH Taplin
SM Astley
TE Cupples
TW Freer
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/05/2008
Field of study

Object: To understand decision processes in CAD-supported breast screening by analysing how prompts affect readers’ judgements of individual mammographic features (lesions). To this end we analysed hitherto unexamined details of reports completed by mammogram readers in an earlier evaluation of a CAD tool. Material and methods: Assessments of lesions were extracted from 5,839 reports for 59 cancer cases. Statistical analyses of these data focused on what features readers considered when recalling a cancer case and how readers reacted to CAD prompts. Results: About 13.5% of recall decisions were found to be caused by responses to features other than those indicating actual cancer. Effects of CAD: lesions were more likely to be examined if prompted; the presence of a prompt on a cancer increased the probability of both detection and recall especially for less accurate readers in subtler cases; lack of prompts made cancer features less likely to be detected; false prompts made non-cancer features more likely to be classified as cancer. Conclusion: The apparent lack of impact reported for CAD in some studies is plausibly due to CAD systematically affecting readers’ identification of individual features, in a beneficial way for certain combinations of readers and features and a damaging way for others. Mammogram readers do not ignore prompts. Methodologically, assessing CAD by numbers of recalled cancer cases may be misleading

City Research Online

Crossref

Recommended from our members

Modeling the probability of failure on demand (pfd) of a 1-out-of-2 system in which one channel is “quasi-perfect”

Author: Andrey Povyakalo
Bertolino
Bev Littlewood
Bishop
David Wright
Eckhardt
Eckhardt
FAA
Knight
Littlewood
Littlewood
Littlewood
Littlewood
Littlewood
Littlewood
Littlewood
Littlewood
Lorenzo Strigini
Xingyu Zhao
Publication venue: 'Elsevier BV'
Publication date: 01/02/2017
Field of study

Our earlier work proposed ways of overcoming some of the difficulties of lack of independence in reliability modeling of 1-out-of-2 software-based systems. Firstly, it is well known that aleatory independence between the failures of two channels A and B cannot be assumed, so system pfd is not a simple product of channel pfds. However, it has been shown that the probability of system failure can be bounded conservatively by a simple product of pfdA and pnpB (probability not perfect) in those special cases where channel B is sufficiently simple to be possibly perfect. Whilst this “solves” the problem of aleatory dependence, the issue of epistemic dependence remains: An assessor’s beliefs about unknown pfdA and pnpB will not have them independent. Recent work has partially overcome this problem by requiring only marginal beliefs – at the price of further conservatism. Here we generalize these results. Instead of “perfection” we introduce the notion of “quasi-perfection”: a small pfd practically equivalent to perfection (e.g. yielding very small chance of failure in the entire life of a fleet of systems). We present a conservative argument supporting claims about system pfd. We propose further work, e.g. to conduct “what if?” calculations to understand exactly how conservative our approach might be in practice, and suggest further simplifications

City Research Online

Heriot Watt Pure

Crossref

Warwick Research Archives Portal Repository

Assessing Safety-Critical Systems from Operational Testing: A Study on Autonomous Vehicles

Author: Alves
Anderson
Atwood
Banerjee
Berger
Bertolino
Bishop
Bishop
Bloomfield
Brocklehurst
Brocklehurst
Burton
Butler
CENELEC
Cukic
David Flynn
Dixit
Favarò
Favarò
Fisher
Fisher
Goseva-Popstojanova
Huang
IEC
Johnson
Kalra
Kizito Salako
Koopman
Koopman
Koopman
Littlewood
Littlewood
Littlewood
Littlewood
Liu
Lorenzo Strigini
Lv
May
Miller
Pathak
Popov
Shashua
Smidts
Sorkin
Strigini
Strigini
Strigini
Strigini
Strigini
Tian
Tomek
Urmson
Utkin
Valentin Robu
Walter
Waymo
Xingyu Zhao
Zhao
Zhao
Zhao
Zhao
Zhao
Zhao
Publication venue: 'Elsevier BV'
Publication date: 19/08/2020
Field of study

Context: Demonstrating high reliability and safety for safety-critical systems (SCSs) remains a hard problem. Diverse evidence needs to be combined in a rigorous way: in particular, results of operational testing with other evidence from design and verification. Growing use of machine learning in SCSs, by precluding most established methods for gaining assurance, makes evidence from operational testing even more important for supporting safety and reliability claims. Objective: We revisit the problem of using operational testing to demonstrate high reliability. We use Autonomous Vehicles (AVs) as a current example. AVs are making their debut on public roads: methods for assessing whether an AV is safe enough are urgently needed. We demonstrate how to answer 5 questions that would arise in assessing an AV type, starting with those proposed by a highly-cited study. Method: We apply new theorems extending our Conservative Bayesian Inference (CBI) approach, which exploit the rigour of Bayesian methods while reducing the risk of involuntary misuse associated (we argue) with now-common applications of Bayesian inference; we define additional conditions needed for applying these methods to AVs. Results: Prior knowledge can bring substantial advantages if the AV design allows strong expectations of safety before road testing. We also show how naive attempts at conservative assessment may lead to over-optimism instead; why extrapolating the trend of disengagements (take-overs by human drivers) is not suitable for safety claims; use of knowledge that an AV has moved to a “less stressful” environment. Conclusion: While some reliability targets will remain too high to be practically verifiable, our CBI approach removes a major source of doubt: it allows use of prior knowledge without inducing dangerously optimistic biases. For certain ranges of required reliability and prior beliefs, CBI thus supports feasible, sound arguments. Useful conservative claims can be derived from limited prior knowledge

arXiv.org e-Print Archive

City Research Online

Heriot Watt Pure

Crossref

CWI's Institutional Repository

Warwick Research Archives Portal Repository